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"^ , Assuming that a stochastic process X — {Xt)t>o is a sum of a 

2 ' compound Poisson process Y — {Yt)t>o with known intensity A and 

unknown jump size density /, and an independent Brownian motion 

Z = (Zt)t>Q, we consider the problem of nonparametric estimation of 

/ from low frequency observations from X. The estimator of / is con- 

^ ' structed via Fourier inversion and kernel smoothing. Our main result 

0^ ! deals with asymptotic normality of the proposed estimator at a fixed 

^~^ • point. 
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1 Introduction 



Let Y = {Yt)t>o be a compound Poisson process with intensity A and jump 
size distribution F, which has a density /. Assume that Z = (Zt)t>o is 
a Brownian motion independent of Y and consider the stochastic process 
Xt = Yt + Zt- Notice that X = (X()i>o is a Levy process. Suppose that 
X is observed at equidistant time points A, 2A, . . . , nA. By a rescahng 
argument, without loss of generahty, we may take A = 1. Given a sam- 
ple Xi,X2, . . . ,X„, the statistical problem we consider is nonparametric 
estimation of the density /. Notice that the Levy triplet of the process 
X is giv e n by (0,1, i^), where the Levy measure v[dx) equals Xf{x)dx, 



see 



Satd ( 20041 . Example 8.5). Since the Levy triplet prov i des a unique 



means for characterisation of any Levy process, see e.g. ISatol (|200J, Chap- 
ter 2), inference on the law of X can be reduced to inference on u. Most 
of the existing literature dealing with estimation problems for Levy pro- 
cesses is concerned with par amet ric estinia,tion of the Levy measure, see e.g. 
Akritas and JohnsonI ( 19811 ) and lAkritasI ( 19821 ). where a fairly general set- 
ting is considered. There are relatively few papers that study nonparametric 
inference procedures for Levy processes, and the majority of them assume 
that high frequency data are available, i.e. either a Levy process is observed 
continuously over a time interval [0, T] with T ^ oo, or it is observed at 
equidistant time points A„,, . . . , nAn. a nd lim„,_>fy, A„, = 0, lirn n. ^nn n A„ = 
oo, see e.g. Rubin and Tucker ( 19591 ). iBasawa and Brockwelll ( 19821 ) and 
Figueroa-Lopez and Houdrg (J2004l ). On the other hand, high frequency 
data are not always available and it is interesting to study estimation prob- 
lems for this case as well. In the particular c ontex t of a comp ound Poisson 



process we mention iBuchmann and Griibell ( 20031 . 12004 ) and Ivan Es et al. 



( 2007al ). where given a sample Yi,...,Yn from a compound Poisson pro- 
cess Y = {Yt)t>o, n onparametric estimator s for t he ju mp size distribu- 



tion f unction F (se e Buchmann and Griibell (J2003l . 12004 )) and its density 



/ (see Ivan Es et al.l (J2007al )) are proposed and their asymptotics are studied 
as n ^ oo. This problem is referred to as decompounding. The process 
Xt = Yt + Zt constitutes a generalisation of the com pound Poisson mode l 
considered in iBuchmann and Griibell (J2003l . 12004 ) and Ivan Es et al.l (l2007al) 
and is related to Merton's jump-diffusion model of an asset price, see lMerton 
( 197(3 ). Since Z is a Brownian motion, it is natural to call the estimation 
problem of / decompounding under Gaussian noise. Figures [THl] provide an 



indication of the difficulty of the problem. Figure 1 on the following page| 
gives a typical path of the Brownian motion, while Figure [2] gives a path 
of the process X. The difference is at once clear when X is observed con- 
tinuously. If this is the case, then one can see all the jumps in the path of 
X and the problem of estimating / is relatively easy, as no decompounding 



is involved. On the other hand Figures E] and 4 on the next page provide 



discretised versions of the typical paths of the Brownian motion Z and the 
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Figure 1: A typical path of the Figure 2: A typical path of the pro- 
Brownian motion. cess X = {Xt)t>Q- 

process X. In this case both plots look similar and given the highly irreg- 



Figure 3: A discretised path of the Figure 4: A discretised path of the 
Brownian motion. process X = {Xt)t>o- 

ular character of Brownian paths, it is difficult to conclude at which time 
instances jumps occur in the process X. The information on / is contained 
in the jumps and the impossibility to observe them makes the problem of 
estimation of / much more difficult. 

Nonparametric estimation of the Levy measure of a more general Levy 
process than X based on lo w fr equency observations was considered in 
Watteel and Kulperged ( 20031 ) and lNeumann and Reifil ( 20071 ). However these 
authors treat the case of estim ation of the Levy meas u re on ly (or of the 
canonical function K in case of IWatteel and Kulpergeii ( 20031 )) and not of 
its density. Moreover, they study the proposed estimators under the strong 
moment condition E[|Xi|^^ ] < oo, where 5 is some strictly positive num- 
ber. This condition automatically excludes distributions with heavy tails. 
We refer to those papers for additional details. 

Using the stationary independent increments property of a Levy process, 
we see that the problem of estimating / from a discrete time sample from 
X is equivalent to the following: let Xi, . . . , X„ be i.i.d. observations, where 
Xi = Yi + Zi, and Yi and Zj are independent. Assume that the unobservable 



y's are distributed as a random variable 



Y 



N{\) 



J' 



j=i 



where A'^(A) has a Poisson distribution with parameter A and where the Ws 
are i.i.d. with distribution function F and density / and where by convention 
a sum over the empty set is understood to be zero. Thus we assume that 
y is a Poisson sum of i.i.d. VF's. Furthermore, let the random variables 
Zi have a standard normal distribution. Assume that A is known. The 
estimation problem is as follows: based on the sample Xi, . . . , X„, construct 
an estimator of /. 

In this context one might also think of the X's as of measurements of 
the realisations Y^s of some quantity of interest, which are corrupted by the 
noise Z. This way we are in the classical 'signal' plus 'noise' setting and 
the problem at hand is then related to the deconvolution problem, see e.g. 
Wand and JonesI ( 19951 ) for an overview, and in particular to i ts gene ralisa- 



van 



Es et al.l (|2007bl ^. 



tion to the case of an atomic deconvolution, see 

The method that will be used to construct an estimator for / is based 
on Fourier inversion and is similar in spirit to the us e of kernel est i mators in 



van 



Es et al.l (|2nn7al lbl;i. 



deconvolution problems, as well as our approach in 

Let (j)x 1 i'Y 1 <Pz and 0j denote the characteristic functions of X, y, Z, 
and W, respectively. Then by independence of Y and Z and by the fact that 



4>Y{t) 



-X+X(t>fit) 



see e.g. ISatd (|2004l . Chapter 1, Section 4), we have 



cpxit) = <pY{t)<Pz{t) = e-^+^<^/We-*'/^ 



and therefore 



=A0/(t) 



(Pxit) 



e~^e 



Notice that P{Y = 0) = e ^. Inverting 



1 



(Pfit) = ^Log 



, we get 



(1) 
(2) 



= -A, 



.e-tV2 
Here Log denotes the distinguished logarithm, called so due to the similar- 



ity to the distinguished lo garithm as co nstructed e.g. in lChow and Teicher 



1978, Lem ma 1, p. 413 ), ichunel (J2nnil . Theorem 7.6.2). IPinkelestein et a" 
( 19971 ) and ISatd ( 20041 . Lemma 7.6). The difference is that in our case 
the function exp(A(/)/(t)) equals e at t = and not 1. The distinguished 
logarithm of exp(X(/)f{t)) in our case can be defined as 



A + Loe 



<t>x{t) 



(3) 



where Log denotes the distinguished logarithm as constructed e.g. in lChungI 
(|200ll . Theorem 7.6.2), or it can be constructed directly. 

Remark 1.1. Notice that in general the distinguished logarithm of the non- 
vanishing characteristic function (f>(t) cannot be reduced to the composition 
of the principal branch of an ordinary logarithm log with (f). Consider the 
following trivial exa mple: (t> (t) = e**. This characteristic function satisfies 
the requirements of IChuna (J200ll . Theorem 7.6.2), since it takes its values 



on the unit circle in the complex plane and hence its distinguished logarithm 
exists and is given by Log((p{t)) = it. On the other hand if one considers 
the argument of log(0(t)), it is easy to see that it jumps whenever (p crosses 
the negative real axis, see Figure [5] and compare to the argument of the 
distinguished logarithm. This fact is not surprising, given that —1 lies on 
the branch cut of the principal branch of an ordinary logarithm. 
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Figure 5: Arguments of the principal branch of a logarithm and of the 
distinguished logarithm. 



Remark 1.2. Notice that if A < log 2, the distinguished logarithm in ([3]) 
reduces to a composition of the principal branch of an ordinary logarithm 
with exp(A0/(t)). This follows from the fact that 



Log 



e^H(t)\ 



where (j)g{t) = (j)- 



Y\N>0 



(i), cf. 



van 



Es et al.l ( 2007al ). It is immediately seen 



that the condition A < log 2 will then prevent exp(X(pf{t)) from taking values 
on the negative real axis, which constitutes the branch cut for the principal 
branch of an ordinary logarithm. 

Assuming that (j)f is integrable, by Fourier inversion we obtain 

1 r _.... / ^x{t) 



/(^) 



27rA 



-itx 



Log 



-Xp-t^/2 



dt. 



This expression will be used as the basis for construction of an estimator 
of /. Let (j)emp denote the empirical characteristic function of the sample 
-'^l; • • • ^Xn, 



1 



n ^-^ 

Furthermore, let if be a symmetric kernel with Fourier transform (^^ sup- 
ported on [—1,1] and nonzero there, and let /i > be a bandwidth. The 
density q oi X can then be estimated by a kernel density estimator 

Its characteristic function 4>q^,^{t) = 4>empit)4'w{ht) will serve as an estimator 
of (f)x{t)- For those w's from the sample space 0, for which the distinguished 
logarithm in the integral below is well-defined, / can be estimated by the 
following plug-in type estimator, 

;„,(,)'/''%-... Log f%i%MV, (4, 



while for those w's, for which the distinguished logarithm cannot be defined, 
we can assign an arbitrary value to fnhix), e.g. zero. The distinguished loga- 
rithm in (jll) can be defined only for those w's for which (f>emp{t)(t>w{ht)e^e* '^ 
as a function of t does not vanish on [—l/h,l/h]. In fact in Section [2] we 
will prove that as n ^ oo, the probability of the exceptional set where the 
distinguished logarithm is undefined, tends to zero. For technical reasons 
which will become apparent in the proofs, we also need to truncate fnhix), 
and consequently, we define the estimator of f{x) not by the expression 
above, but by 

fnh{x) = (M„ A fnh{x)) V i-Mn), (5) 

where M = {Mn)n>i denotes a sequence of positive numbers converging to 
infinity at a suitable rate to be specified below. 

Concluding this section, we state conditions on the density /, the band- 
width h and the truncating sequence M that will be used in Section [2j 

Condition 1.1. Let the density f be such that (pf is integrable. 

Condition 1.2. Let the kernel w be the sine kernel, w{x) = (sinx)/7r2;. 

The Fourier transform of the sine kernel is given by (pwit) = l[-i,i](i)- 
The sine kernel has been used successfu lly in kernel density estimation since 
a long time, see e.g. IPavid (jl975l . 119771 ). It is the simplest example among 



the so-called superkernels, i.e. kernels the Fourier transforms of which are 
identically 1 in some open neighbourhood of zero. For more information 

6 



on the l a tter c las s of kern e ls we refer e.g. to iDevroye and Gyorfil (Il985l ). 
Devrovd ( 19881 ) or iDevrovd ( 19921 ) . An attractive feature of the sine kernel 
in ordinary kernel density estimation is that it is asymptotically optimal 
when one selects the mean square error or the mean integrated square error 
as the criterion for the performance of an estimator. Notice that the sine 
kernel is not Lebesgue integrable, but its square is. 

Condition 1.3. Let the bandwidth hn depend on n and he such that /i„, ~ 
(logn)^^, where /5 < 1/2. 

Notice that this condition implies ne~^' " -^ oo. In the sequel we will 
suppress the subscript used to demonstrate the dependence of h on n, since 
no ambiguity will arise. 

Condition 1.4. Let the truncating sequence M = (M„)„>i he such that 
Mn = Clogn, where C > is some constant. 

The rest of the paper is organised as follows: in Section [2] we show that 
with probability approaching 1 as n — > oo, the distinguished logarithm in 
([3]) is well-defined and subsequently we state the main result of the paper 
concerning the asymptotic normality of fnh at a fixed point x. The section is 
concluded with a brief discussion of the obtained results. Section [3] contains 
a simulation example. All the proofs are collected in Section HI 



2 Main result 



We first establish that with probability tending to 1 as n — > oo, the dis- 
tinguished logarithm in (j4]) is well-defined. Thus our goal is to find a set 
Bnh, such that on this set the distinguished logarithm might be undefined, 
while on the set -B^^ it is well-defined. Fix iv from the sample space and 
consider the quantity 



sup 



■'emp 



it) 



4>x{t) 



-Ag-t2/2 



-Ag-t2/2 



(6) 



Now suppose that there exists a small number 5, such that 



sup e 



l/(2/.2) 



(pempit) (pxit) 



<6. 



Obviously this implies that © is less than 6. If 6 is small enough, then since 
4>x{t)e^e'^ '^ = exp[A0/(t)] is bounded away from zero, also (pemp{t)e^e^ '^ 
will be bounded away from zero on [— l//i, l//i]. From this it follows that 
on this interval one can define the distinguished logarithm of (f>emp{t)e 
This simple observation shows that on the set 



=AgtV2 



Bnh 



UJ 



sup 



,1/(2^2) 



'^emp 



(t) 



OX 



(t) 



<6 



the distinguished logarithm will be well-defined for 6 sufficiently small. Thus, 
what remains to be done is to prove that the probability of the complement 
of this set converges to zero as n ^ oo. To this end we will make use of the 



following theorem from iDevrovd (|1994| ). 



Theorem 2.1. Let X be a random variable with characteristic function (p 
and finite first moment, and let (pn be the empirical characteristic function 
of the i.i.d. sample Xi, . . . , X^ drawn from X. Then, for a and /?, possibly 
dependent upon n, 



P snp\Mt)-Hi)\ >/9 
Vl*l<" y 



<4 1 + 



8aE[|X| 



-nfi/72 



+ 0(1), (7) 



where the o(l) term is uniform over all a and (3. 



Remark 2.1. In our results we need additional information on the o(l) 
term in ([7]). It follows from the proof of Theorem 12.11 that it is bounded by 



1 " 



i=i 



>^n\xi\ 



(8) 



see lDevrovd (| 19941 ). Since the X's are not b ounded, it is not possible to apply 
Hoeff ding's inequality, see iHoeffdingI ( 19631 ). to show that this probability is 
exponentially small. At the same time, verification of the moment conditions 
needed for Bernstein's inequality to hold is difficult in our case and might 
require strong conditions on Y. Therefore we opt for an unsophisticated 
application of Chebyshev's inequality to bound this probability. 

The following proposition follows from Theorem 12.11 

Proposition 2.1. Assume Conditions \1.2\ and \1.3\ and let E [|X|] < oo. 
Then the distinguished logarithm in ^ is well-defined with probability tend- 
ing to 1 as n ^ oo. Moreover, i/E [|X|''] < oo for 1 < p < 2, then 



F{B, 



nh) 



o 



1 



nP' 



and if E [\X\P] < oo for p > 2, then 



P{Bnh) = O 



^p/2 



The main result of the paper concerns the asymptotic normality of fnhix) 
at a fixed point x. The following theorem holds true. 



Theorem 2.2. Suppose that A is known. Let the estimator fnhix) be de- 
fined as in ([5]), and assume that Conditions I J.ihfTTT] hold. Furthermore, let 
\x\Pf{x) be integrable with p > 3/2. Then 

jJ^iUx) - E[/„,(x)]) ^ N (o, ^) 

as n ^ oo. 

Rema rk 2.2. Not ice that the integrabihty of \x\''f{x) implies E [\X\p] < cxd, 



see e.g. ISatd (|2004l . Corollary 25.8). Thus the conditions of the Theorem 



cover a large class of distributions with heavy tails. 

Remark 2.3. From Theorem 12.21 it follows, that in order to get a consis- 
tent estimator, nh~^e~^'^ has to diverge to infinity. This means that the 
bandwidth h has to be fairly large, i.e. of order (logn)~^, where /3 < 1/2, 
thus resulting in a slow, logarithmic rate of convergence of fnhix). This is 
in sharp contrast with the ordinary de comp ounding case, w here the conver- 
gence rate is polynomial, see Section Ivan Es et al.l (l2007al ) . On the other 



hand, the convergence rate of fnh{x) is similar to that in the ordinary de- 
convolution, as well as the deconvolution for an atomic dis tribution, when 
the error dist r ibution is assumed to be supersmooth, see e.g. iFanI ( 199ll ) and 



van Es et al.l (J2007bl ). This fact should not come as a surprise, due to the 



similar structure of these problems and the presence of Gaussian noise in 



our m odel. We also mention that in a recent preprint iNeumann and Reifj 



(J2007l ). under some conditions on the Levy measure u, obtained similar log- 
arithmic lower bounds for estimation from low frequency observations of the 
Levy measure z^ of a general Levy process with a Brownian component. 



Remark 2.4. Using the estimator png from Ivan Es et al.l ( 2007bl ) . an esti- 



mator of A can be defined as A„g = — log png, of course provided that png is 
strictly positive. However the proof of Theorem 1 2. 2 1 for the case of unknown 
A is a highly nontrivial task. 

Apart of Theorem 12.21 it is also interesting to study the asymptotic 
distribution of 

J^^iUx) - fix)), (9) 

i.e. of the estimator fnhix) centred at the true density f{x). After rewriting 
the above expression as 



f,^rm^(fr.h{x) - fix)) = ^^^^^^^,^ (fnhix) - E [fnhix)]) 

n 



we see that we have to study the behaviour of the bias of the estimator 
fnhix), which is given by E [fnhix)] — fix). It will turn out that the behaviour 



of the bias depends on the tail behaviour of the characteristic function of 
/. For our purposes it suffices to distinguish two cases: in the first case we 
will assume that 0/(t) = 0(e~'*' ) with 1 < a < 2, and in the second case 
we will assume that (pf{t) = 0{\t\~^) as t ^ oo with 7 > 1. These two 
cases find a parallel in deconvolution problems, where a distinction is made 
between the use of supersmoo th or ordinary smooth distributions to model 
the error distribution, see e.g. iFanI ( 199ll ). 



Proposition 2.2. Suppose that A is known. Let the estimator fnh{x) he 
defined as in ([5]) , and assume that Conditions \1.1[]1.4\ hold. Furthermore, 
let f have a finite pth moment, p > 1. 

(i) If 4>f{t) = 0(e~l*l°) as \t\ -^ 00 for 1 < a < 2, then we have 

E[f^hix)]-f{x) = 0{h^-'e~'/^'') 

as n ^ cxD. 

(a) If (pfit) = 0{\t\-^) as \t\ -^ 00 for J > 1, then 



E[U{x)]-f{x) = 0{h^~'). 



as n ^ CXD. 



Remark 2.5. Despite the fact that the bias of fnh{x) asymptotically van- 
ishes, the consequence of Proposition 12.21 is that the asymptotic normality 
of ([9|) cannot be established for the symmetric stable densities. Of course, it 
cannot be established for other densities either, the characteristic functions 
of which decay algebraically. Examination of the proof of Proposition 12.21 
demonstrates that in order to have that ^ is asymptotically normal, one 
has to assume that, e.g., (/>/(t) = e~'*'° with a > 2. However if this is the 
case, then (p'JO) = 0'^(O) = and consequently the first two moments of / 
have to vanish. There does not exist a density with such properties. 

Remark 2.6. It appears that in our case the square bias dominates the 
variance of the estimator. This is not sur prising in view of similar results 
obtained in iButucea and Tsybakovl (J2004l ) for the ordinary deconvolution 



problem: suppose 

X = Y + Z, 

where Y and Z are such that 

/■oo 

|2 



|(/)y(t)rexp(2a|tf'') <27rL, 

bmin\tr eM-P\tn < ^Z{t) < brna.\tr' exp(-/5|tn. 

Here a,r, L,bmin,r,bmax, are strictly positive constants, 7 are 7 are real 
numbers, and it is assumed that r < s. Then the square bias of the decon- 
volution kernel density estimator, which is based on observations on X and 
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is evaluted for the sine kernel, dominates the variance. In our case Y does 
not even have a characteristic function which vanishes at plus and m inus 
infinity. The similarity to the model in iButucea and Tsybakovl ( 2004 ) also 



holds true when comparing 4>f and (j)z, as cpz, being the characteristic func- 
tion of a standard normal random variable, in a certain sense represents an 
extreme case among characteristic functions of supersmooth distributions. 

3 Simulation example 

Practical implementation of the estii nator (|^ is not a stra ightforward task. 
The idea we use is similar to that of Ivan Es et al.l ( 2007al ). Notice that we 



can rewrite (g]) as fnh{x) = /„! (a:) + /„^ (2;), where 

^"^ ^"^^ " 27rA io ^\e-^e-tyy ' 

Using the trapezoid rule and setting Vj = ri{j — 1), f^fl{x) can be approxi- 
mated by 

1 ^ 

Here we take N to be some power of 2 and ip is defined by 



Prom this point on one can proceed as in Ivan Es et al.l (J2007al ) and eval- 



uate (fTT]l for a set of appropriately selected points xi, . . . xn via the Past 
Pourier Transform. A similar reasoning applies to f^fiix). 

The general difficulty with implementing the estimator is the computa- 
tion of the distinguished logarithm, i.e. of function ip. A way to do this is to 
take a fine grid of points, evaluate the argument of the ordinary logarithm 
there and if one sees large jumps of size comparable to 2tt between two con- 
secutive points, make appropriate changes to the argument, thus obtaining 
an approximation to the argument of the distinguished logarithm. Of course 
this approach works only when (pempit) does not vanish on [— 1//1, 1/h]. The 
latter fact can be verified in theory only, while in practice this can be done 
only for a grid of points ti, ^2, . . . t^i which thus has to be taken rather fine, 
so that one does not possibly miss the value zero. 

Though our emphasis is more on theoretical aspects of decompounding 
under Gaussian noise, we nevertheless will consider one simulation example 
in this section. We took A = 1 and / the standard normal density and 
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simulated a sample of size n = 5000. The bandwidth h = 0.5 was selected 
by hand. The resulting estimate fnh (bold dotted line) together with the 
true density / (dashed line) is plotted in Figure El We notice that the fit is 
quite good. Furthermore, notice that 



P(A^(A) > 2) = 1 - 2e" 



0.264. 



It turns out that we considered a nontrivial example, since a considerable 
number among the 1^'s are sums of the Wj's in this case. 

We should stress the fact that this simulation example serves as an illus- 
tration only and an extensive simulation study is needed to investigate the 
finite sample performance of our estimator and its behaviour in practice. 
We have to be very careful when generalising our conclusions concerning 
this simulation example because of the fact that the empirical characteris- 
tic function is oscillatory in its tails. If the integration step size t] is not 
small enough, we might miss instances when (pemp crosses the negative real 
axis. This will have direct consequences for the argument of the distin- 
guished logarithm. This is especially true for relatively small sample sizes, 
for which the empirical characteristic function (pemp might not approximate 
the true characteristic function (px well enough. The issue of selection of 
T] in practice remains open and a thorough simulation study is needed to 
obtain some practical recommendations how this can be done. Additionally, 
a data-dependent method of the bandwidth selection has to be created. 



"■1^ 

/;•;, 



Figure 6: Estimation of the normal 
density, n = 5000. 



4 Proofs 

Proof of Corollary \2.1{ Note that we have 
V{Br,h) < 4 1 + _^,J , 1 exp 



e-^5 



h 



72 



+ o(l), 



12 



where we assume that 5 is small enough. This bound follows from Theo- 
rem l2.1l with a = 1/h and (3 = e~'^6e~^'^'^^ '. The right-hand side converges 
to zero as n ^ oo due to Condition 11.31 To prove the corollary, the only 
additional fact that we need to verify is tha t the o(l ) term from Theorem 
12.11 which in the proof of Theorem 12.11 from IPevroyd ([l994) is bounded by 
([8]), is of order n^^P, if 1 < p < 2, and is of order n~f''^, if p > 2. In fact, if 
the inequality 



1 






holds, then we have 



1 " 

-Y^X,-E[X,] 



n 



i=i 



> 



-> X, 



1 " 

n ^^ ■' 



>3E[|Xi|] 



|E[Xi 



>3E[|Xi| 



E[|Xi| 



gEOXil 



By Chebyshev's inequality this implies 



1 " 



i=i 



>f^[\Xi\ 



< P 




<3^(E[|Xi|])-^E 



Suppo se first that 1 < p < 2. Then it follows from Theorem 4 of lvon Bahr and Esseen 
(jl965l ) that the rig htmost term in (fT2]) is of order n^~ ^. Now suppose p > 2. 
Then Theorem 2 of lDharmadhikari and Jogded ( 19691 ) implies that the right- 
most term of (\12h is of order n~^''^ . For explicit constants we refer to the 
same papers. 

Assume again that 1 < p < 2. To complete the proof of the corollary, we 
have to verify that 



=1/(2/1^) 



-2Aa2 



6^ne-^l^ 



h 



■ exp 



72 



n 



p-i 



0. 



To this end we take the logarithm of the left-hand side to obtain 

-2Aj2 



i-^°^^-^^^ 



-l//l2 



+ (p- l)logn. 



(13) 



(14) 



The first term here is of order (logn)^^ and is negligible compared to logn. 
The second term is of order log log n and is thus negligible, while the third 
term dominates logn. Therefore (|14|) diverges to minus infinity and conse- 
quently (I13p holds. The proof for the case p > 2 is virtually identical and 
therefore it is omitted. D 
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The following lemma will be used in the proof of Theorem 12.2 
Lemma 4.1. Assume the conditions of Theorem \2.^ Let 

1 r^l^ 



fLi^) 



2tt 



i/h 



-itx 'remp(t) 



dt. 



Then 



as n — > 0. 



^^^/:.(-)-E[/:.(x)]^AAfo,^] 



=2A \ 



Proof . The proof is a minor variation of the proof o f Theorem 2.1 of lvan Es and Uh 
(J2004l ). The arguments of Ivan Es and Uhl (J2004l ) are applicable, because 
they only use the existence and continuity of the density q of X, which is 
still true in our case. 

Let S denote a random variable, independent of the X's and with a 
density 

fs{x) 



1 gxV(2h2)^ 



cih) 



[0,1]' 



where the normalisation constant c{h) = L e^ '^ 'dx. Furthermore, let 
En = (S — l)//i^ and let E he a standard exponential random variable. If 
for a random variable S we set 



S 



,(x,-< 



E (cosi -(X^-x) ]\Xj] =Es 



cos I — {Xj — x] 



then f*fi{x) — E [/*^(3;)] can be written as 



f*j,{x)-E[f*j,ix)] 



c(K)_ 
Tinh 



E 

i=i 



cos 



X^ — x\^ 



cos I — hE{Xj — x] 



sm 



Xj — X \ „ 



-E 
+ E 



cos 



sm 



Xj — X .^ 



Xj — X .^ 



sin ( — hE{Xj — x 
cos I — hE{Xj — x] 

sin I — hE{Xj — x) 



+ Op f^h's'/'^'^'A 



(15) 



cf. equation (31) of lvan Es and Uhl (iooi). 
A straightforward computation yields 



E, 



cos ( — hE{Xj — x] 



l + /i2(Xj -x)2 



wi{h{Xj — x)) 
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and 

Ee 
where 



sm 



-h^{Xj-x) 
wi{u) 



l + u^ 



h{Xj - x) 
and W2(u) = 



W2{h{Xj -x)), 



l + v?' 



Define the random variables Vnj as 



Vnj = cos [-^ — 



X — x^ 
wi{h{Xj — x)) — sin ( — ^— ]w2{h{Xj — x)) 



= cos{Yhj)wi{h{Xj - x)) - sm{Yhj)w2ih{Xj - x)) 
where Yhj = {Xj — x)//imod27r. Then 



(16) 



/:,(x)-E [f:,ix)] = ^ i ^{Vn,-^ [K,,])+0P (^h'e'/^'^'A . (17) 



(18) 



Note that by the inequahty \a + b\P < 2P{\a\P + \b\P),p > 0, we have 
E [{Vn, - E [K,,])'] < 16(E [V^J + (E [K,,])'). 

Since the characteristic function cpx is integrable, by IChungI ( 200ll . The- 
orem 6.2.3) the density g of X is con tinuous and bounded. Hence X sat- 
isfies the conditions of Lemma 3.1 of Ivan Es and Uhl (200J), and we have 

{hX,Yhj) ^ (0, U). It also holds that 

E [Vnj] = E [cos{Yh,j)wi{h{Xj - x)) - sm{Yhj)w2{h{Xj - x))] 
-^ E [cos{U)wi{0) - sm{U)w2{0)] = 

and 

E [V^j] = E [{cos{Yhj)wi{h{Xj - x)) - sm{Yhj)w2{h{Xj - x)))^] 

^ E [icos{U)wi{0) - sm{U)w2m^] = E [(cos(C/))^] = ^, 

o 

because the cosine is a bounded and continuous function. The asymptotic 
variance of Vnj is given by 

Var[Kj] -^ E [{cos{U)wi{0) - sm{U)w2{0))^] = E [{cos{U)f] = ^. 



It follows from ([HD that 

E[\Vn,-E[Vn,r] 



Oil) 



n(Var[Kj])2 



n(i+o(l))2 



(19) 



15 



Consequently, Lyapunov's condition with 6 = 2 is satisfied for Vn.j, and 
hence both l/n^^^,(K,,- - E [VnJ] and c{h)/{7Thn) E"=i(K,i - E [VnJ] 
are asymptotically normal. The asymptotic variance of the latter is given 
by 



Var 









which follows from Lemma 2.1 of lvan Es and Uhl ( 20041 ). This completes the 
proof of the lemma. D 

Proof of Theorem\2M Write Cn{h) = ^h-^e-^/'-^''''\ We have 

Cn(/i)(/n/«(2;)-E[/„;,(x)])=CnW(/n/.(x)-/(x)) + C„(/i)(/(x)-E[/„;.(x)]) 

= Uhmnhix) - f{x))lB„, - E[{U{x) - /(x))1b„J) 

+ Uhmnhix) - /(x))1b';, - mnhix) - f{x))lB^J), 

where the set Bnh is defined as in Section [2j Now notice that, for an arbitrary 
constant 77 > 0, by Chebyshev's inequality we have 

P{Uh)\{fnh{x) - /(x))1b„, - E[(/„,(x) - /(x))1b„J| > v) 

<lCnih)E[\fnh{x)-f{x)\lB^,]. (20) 

Since (j)f is integrable, it follows that |/(x)| < C, where C is some constant. 
It then follows that the probability at the left-hand side of (l20]) is bounded 

by 

-Cnih){Mn + C)P{Bnh). (21) 

rj 

Now we apply the bound of Proposition 12.11 to F{Bnh)- To prove that ([2T]) 
converges to zero, it is sufficient to verify that 



n 



/ieV(2/^2) 



(M„ + C) 



nf- 







(22) 



for 3/2 < p < 2. This is obviously true due to Condition 11.31 and 11.41 
The proof that (|20|) converges to zero for p > 2 is likewise straightforward. 
Therefore 

Uh){{Lh{x) - /(x))1b„, - E[iUix) - /(x))lB„J} ^ 0. 

Hence by Slutsky's theorem, see Ivan der VaartI ( 19981 . Lemma 2.8), this term 
can be neglected and it suffices to consider 

UhnUnhix) - /(x))lBe - E[{Uhix) - f{x))lB^]}. 
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We have that 



log 



</'x(i) 



-Ap-t2/2 



i.e. the real part of the distinguished logarithm 



Log 



-Ag-t2/2 



(23) 



is bounded. On the set -B^^, if S is selected small enough, (/)emp(i)e'^e* '^ is 
arbitrarily close to (/>x(Oe e* '^ and also stays bounded away from zero at 
a positive distance. Therefore 



l/h 
-1/h 



log 



VempXj') 



e-^e-*' 



dt < C-, 
h 



where C is a constant. This grows slower than M„ and hence M„ will 
eventually dominate. Now we turn to the imaginary part. Let -0 : M ^ C, 
where 

V,(t) = </>^(t)eV'/2 = e^<^/W. 

By the Riemann-Lebesgue theorem 'tp{t) converges to 1 as |f | 
there exists t* > 0, such that 



oo and hence 



Furthermore, we have 



m) - ii < 



mt)\>e- 



2' 



t > t* 



t£ 



(24) 



(25) 



Since / has a finite first moment, by I Schwartz! (Il966l . Theorem 1, p. 182) 
(pf and consequently ip are continuously differentiable. Therefore, the path 
V' : [— t*,t*] — > C is rectifiable, i.e. has a finite length. In view of this fact 
and (j25p . ip : [— t*,f*] — > C cannot spiral infinitely many times around zero 
and for \t\ > t* it cannot make a turn around zero at all because of ()24p . 
Consequently, for the same reason as we gave above for the real part of the 
distinguished logarithm, the truncation on this set becomes unimportant for 
the argument as well and we have 



Thus we have to consider 

Uh){{fnh{x) - f{x))lB^^, - E[(/„,(X) - /(x))1b. ]}. 



(26) 
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Plugging in the expressions for fnh{x) and f{x), we obtain that the above 
expression is equal to 



Cn{h)< 



1 



2ttX 



27r' 



l/h 
-1/h 

-E 

-l/h 



-itxj „„ I 4>emp{t) I ^^^ 



Log 



^x{t) 



B" 



1 



27rA./-i//, V 0x(i) ' "'^ 



l/h 



-itx. 



oo 



e -</.;(t)dt + / e-**>/(t)rft (1b^, - E [Ib^J). (27) 

oo Jl/h J 



4(ii 



First notice that 

-l/h 



-itx 



(t)f{t)dt + 



~itx 



l/h 



'■(t)f{t)dt 



< 



|(/)/(t)|dt < oo. 



Consequently, the last term in (1270 converges to zero in probability if 
This in turn is equivalent to 

CnW(lB„,-E[lB„J)^0, 

because l^c = 1 — 1b„,^. By Chebyshev's inequality it is sufficient to prove 
that Cn{h) P{Bnh) -^ 0. However, this follows from ^ and (f22]l . 

Hence, by Slutsky's theorem we have to consider the first term of ([2] 



Uh)< 



1 



2itX 



l/h 



-l/h 



-itxj ^„ I 4>emp[t) , ^^^ 



Log 



E 



1 /-v^ 



27rA 



l/h 



_B= 



-iti'^Q / 9emp{t) 



Hit) 



dtl. 



Rewrite this as 

' 1 rV^ 



Cn(/i)< 



27rA 



l/h 



E 



(px{t) J "" 

L /'^' e-*x. ( ^emp{t) _ ^ 

'ta y_i/,, V (i^x{t) 



27rA 



dtl; 



+ Cn(/l)< 



27rA 



l/h 
-l/h 



e-''''Rnh{t)dtlB^ 



E 



^/i/""'''"'*""''-]}' *'" 
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where 



Rnh{t) = Log 1 + 



(Pempy'^) 



Notice that on the set 5^^ we have 



'x(t) 

empy'') 



■'emp 



it) 



<Px{t) 



(29) 



(Px{t) 



1 



< 



if 6 is smah enough. Indeed, it suffices to choose 5 in such a way that 
e 6 < 1/2. From the inequahty | Log(l + 2;) — 2| < \z\'^, vahd for z sufficiently 
small, it follows that 

2 



\Rnh{t)\ < 



''emp 



(t) 



(Px{t) 



1 



Consequently, to prove that the second term in (|28p asymptotically vanishes, 
it is sufficient to prove that 



Uh) 



1 



27rA 



-E 



l/h 



-1/h 



\Rnh{t)\dtlB^ 



< Uh) 



27rA 



-E 



l/h 
-l/h 



■'emp 



it) 



<Px{t) 



dt 



0. (30) 



Since |(/'y(t)| "^ < e^"^, we have 



E 



l/h 
-l/h 



(pempXj') 



5X 



{t) 



dt 



< Ce'/'^'E 



\4'empit)(f>wiht) - (l)xit)4>wiht)\'^dt 



where cpy^ is the characteristic function of the sine kernel and C is a constant. 
By Parseval's identity the expectation on the right-hand side equals 



2tt 



(qnhix) - q*Whix))'^dx 



This in turn equals th e integrated varia nce of a kernel estimator g„/j, which 
is of order {nh)~^, see iTsybakovl (J2004l . Proposition 1.7). Thus we have to 
show that 

.2 1 eK^/^') 



n 



Mh-' 



0. 



/ieV(2/i2) nh h?^ 

The result follows from Condition 11.31 and can be verified by taking the 
logarithm of the left-hand side of the above expression and concluding that 
it diverges to minus infinity. We obtain 



1 



1 



— -TT — log h log n 

2/i2 ^ 2 ^ 



-oo. 
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because /i~^ = (logn)^^ and 2/5 < 1, and hence the dominating term on the 
left-hand side in the above expression is the last one. 

We deal with the first summand in (1281). Rewrite it as 



Uh). 



1 



2itX 



l/h 



1/h 



^-itx 'remp[t) ^ 

(t)x{t) "" 



E 



27rA 



^'\-a.4>emp{t) 



l/h 



(t>x{t) 

'l/h 



dtl 



B" 



We want to show that the second summand in this expression converges to 
zero in probability. Notice, that it is bounded by 

CUh)l\lB„,-E[lB„,]\, 

because l^c = 1 — 1^^^ . Here C is some constant. By Chebyshev's inequal- 
ity it is sufficient to prove that 

Cnih)^PiBnh) ^ 0. 



This is obviously true thanks to ()20p and (I2ip . 

Thus, by Slutsky's theorem, instead of (j3ip we may consider 



Uh)< 



27rA 



-itx 'remp\t) 

l/h 4'x{t) 



dtl 



B" 



-E 



1 



27rA 



^~itxVemp{t) 

e — ] — 7-^atifjc 



l/h 



(t)x{t) 



Note that for |t| < l/h 

\<t^x{t)\ 



-tV2p-A+A0^(t) 



> e-l/(2h2)g-2A 



holds. Consequently, we have 



Cn(/i)E 



27rA 



l/h 
-l/h 






it) 



1 2 



<Uh)-^U'^(''^^e''nBnh), (32) 
27rA h 



which converges to zero thanks to the fact that P{Bnh) = 0{n^ ^) for 
3/2 < p < 2, and P(S„/,) = 0(n~^/2) for p > 2, see Proposition EH 
Hence by Slutsky's theorem we may consider 



Uh)< 



2ttX 



-l/h (t)x{t) 



dt-E 



2ttX 



-l/h (t)x{t) 



dt 
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By ([T]) the expression above can be rewritten as 



e^ 1 Z"^' 






dt 



^'■■""''ts? 



jA 1 [llh 



-itx 



-1/h 



<t>emp{t) 4>x{t) 



-\<t>f{t) 



1 dt. (33) 



By Leninia [4. li the first sumniand in this expression is asymptotically normal 
with zero mean and variance given by cr^ = e^ /(27r^A^). 

Now we will show that the second term in (j33p asymptotically vanishes 
in probability. By Chebyshev's inequality it suffices to show 



iUh)fE 



e^ 1 
T2^ 



^^^ -itx ( (t>emp{t) _ <hdt)\ ( X4>f{t) 



l/h 



(Cn(/i))'Var 



e-*V2 

l/h 



l\dt 



-itx Yemp 



it) 



l/h 



-t2/2 



(e-^^/W - l) dt 



Using the independence of the Xj's, after further simplification we obtain 
that we have to prove that 



1 



Thus we have to prove that 
1 



l/h 
-l/h 



gi2/2|g-A0^-(t) 



lldt 



heW^'-') y_i/. 



l/h 



f/2.-Mf{t) _ ii^i 



0. 



From Ivan Es et alj (|2007al ) we have 

where the constant C\ depends on A only. Therefore it suffices to prove 

fi//i 



1 ri'' 

'h^^^)Li/h 



e* l''\4>f{t)\dt 



0. 



(34) 



This can be done either via an application of L'Hopital's rule or via the 
metho d similar to the one used in the proof of Lemma 5 in Ivan Es and Uh 
( 20051 ). We follow the latter path. It is enough to consider the integral over 
[0, l//i] as the integral over [— l//i, 0] can be dealt with in a similar fashion. 
After the change of integration variable v = (\ — ht)/h?' , we obtain 



h 



1/^2 



e 2h^ 



"</•/ 



h 



dv = he2h^ 



l//l2 



-(j)f 



h 



dv. 
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By the Riemann-Lebesgue theorem hmui^oo 0/(^) = 0) ^^^^ therefore by the 
dominated convergence theorem the above expression is of order o{he^'^ '). 
The dominated convergence theorem is appHcable, because 

(e ' -e ^ ' )l[0,i/h2] > 

and hence e~^' ^ can be taken as the dominating function. Consequently ()34p 
vanishes as /i ^ and this argument concludes the proof of the theorem. 

D 

Proof of Proposition \2.2[ We will prove both parts of the statement simul- 
taneously. Write 

E [fnhix)] - fix) = E iiflf^ix) - /(x))1b„J + E [(/„,(x) - f{x))lB^J. (35) 

Notice that for some C > 0, 

E[{fnh{x) - f{x))lB,j\ < {Mn + C)F{Bnh). (36) 

Here we used the fact that / is bounded, because (/>/ is integrable. Due to 
Theorem 12. 1^ Corollary 12.11 and Conditions 11.31 and 11.41 we see that (I36p 
converges to zero as n ^ oo. Moreover, this term is negligible compared 
to h"-'^e-^/^" (case (i)) or h^-'^ (case (ii)), since P{Bnh) = 0{n^-f) or 
P{Bnh) = 0{n~f''^), depending whether l<p<2orp>2, see Proposition 
ED 

Now we turn to the second summand in (j35p . By selecting 5 small 
enough, on the set 5^^ truncation in the definition of fnh{x) becomes unim- 
portant, see the arguments that led to ([26]) . Hence we have to deal with 
E [[fnhix) — /(x))1bc^]. Using expressions for fnhix) and /(x), we see that 
this term equals 



E 



1 /"'^"e-t-Logf^^^ldtl, 



2vrA J_iih V (t^xit) 



e-^'-^Mt)dtPiB^^,) - ^ y^ ^ e-*>/(t)dtP(i?;j,). (37) 

The last two terms in this expression can be treated similarly and therefore 
we consider only the second one. It will turn out that these are the leading 
terms in the bias expansion. Notice that 



1 f°° 

-/ e-^'^c^fit)dtPiB^^,)^0 



2vr .ii/h 



as n ^ 0, because 4>f is integrable. Moreover, if 4>fit) = O [e I*'") , a > 1, 
then 

r \(l)fit)\dt = O (h''-^e-^/'''^y 
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This f act can be proved using the same type of arguments as in lCasella and Berger 
(|200l Example 3.6.3, p. 123). Furthermore, if 4>f{t) = O (|t|-^) , then 



E 



\(l)f{t)\dt < c / r^dt = 0{h^-^). 

l/h Jl/h 



Now we turn to the first term in (|37p . Rewrite it as 
1 /■^/'^ 



2ttX 



-l/h 



-itx ( 9emp[t) ^ 



dtl 



B" 



E 



1 /"i/^ 



2ttX 



l/h 



e'''''Rnh{t)dtlB^ 



(38) 



where Rnh{t) is defined as in (j29p . Consider the first term in this expression. 
Rewrite it as 



E 



1 /■^/'^ 



27rA 



-l/h V '/'x(t) 



dtl 



B" 



E 



27rA 



e **^' ( ^^^2!!M^ _ 1 ) Jt 



E 



l/h 

^-ttx I I 

-l/h \ 4>x{t) 

1 r^/'' 



27rA 



-l/h V 'Ax(i) 






The first summand here is equal to zero. As far as the second summand is 
concerned, notice that 



l/h 

-l/h 



e ^*" ( ^^^^ - 1 Mi 



<Px{t) 



h 



where C is some constant. This inequality follows from the facts that for 

te[-i/h,i/h], 



'^emp 



(t) 



<t>x{t) 



< 



-'emp 



it) 



•}x 



it) 



+ 1, 



(Pemp\t) 



'}X 



it) 



< e2^e^/(2h2) 



because (pxit) = <Ay(*)e * '^ and |</'y(i)| > e '^^. Consequently 



E 



1 f^/^ 



27rA 



e -**" ( ^^^SUM^ _ 1 1 dtlB 



-l/h 



^xit) 



h 



nh) 
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This term will converge to zero as n — > oo due to Theorem 12.11 and Proposi- 
tion [2TTJ Moreover, due to the same facts, it is negligible compared to K^"^ 
or to/i^-^e-i/'*". 

Now we consider the second term in (|38p . Notice that this term is of 
order {nh)~^, which was shown in the proof of Theorem l2.2l in the arguments 
concerning (I30p . Consequently it will be negligible compared to h'^~^ or to 
j^a-i^-i/h" ^ This completes the proof of the proposition. D 

Proof of Remark \2.5[ We have to study the behaviour of 

C„(/i)/i"-^e~i/'*". (39) 

After taking the logarithm, we obtain 

Dominating terms here are the second, the fourth and the last one. Now note 
that the fourth and the last terms equal — (l/2)(log?7-)^^ and — (logn)°^, 
respectively. In view of 2/3 < 1 and a/3 < 1, these terms are dominated by 
logn and hence (flO]l diverges to plus infinity. It follows that so does ([391) . 
The case of Cnih)h'^^^ ^ cxd is trivial given Condition 1 1.3[ D 
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